Generalized Distillation Framework for Speaker Normalization

نویسندگان

  • Neethu Mariam Joy
  • Sandeep Reddy Kothinti
  • S. Umesh
  • Basil Abraham
چکیده

Generalized distillation framework has been shown to be effective in speech enhancement in the past. We extend this idea to speaker normalization without any explicit adaptation data in this paper. In the generalized distillation framework, we assume the presence of some “privileged” information to guide the training process in addition to the training data. In the proposed approach, the privileged information is obtained from a “teacher” model, trained on speaker-normalized FMLLR features. The “student” model is trained on un-normalized filterbank features and uses teacher’s supervision for cross-entropy training. The proposed distillation method does not need first pass decode information during testing and imposes no constraints on the duration of the test data for computing speakerspecific transforms unlike in FMLLR or i-vector. Experiments done on Switchboard and AMI corpus show that the generalized distillation framework shows improvement over un-normalized features with or without i-vectors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification

The purpose of this paper is to unify several of the state-of-the-art score normalization techniques applied to text-independent speaker verification systems. We propose a new Bayesian framework for this purpose. The two well-known Zand T-normalization techniques can be easily interpreted in this framework as different ways to estimate score distributions. This is useful as it helps to understa...

متن کامل

Segmental Normalization for Robust Speaker Verification

For the task of speaker verification, similarity measure normalization methods are relevant to cope with variability problems and with data and/or decision fusion problems. The aim of this paper is to suggest a new method of normalization which combines classical world model based normalization techniques with ones based on a posteriori probability. This original method presents the well-known ...

متن کامل

Structural framework for combining speaker recognition methods

The paper describes a structural framework for the design of a speaker recognition system based on multiple models. This combination is not only at the recognition level, but also at a joint training of the models. This unified training of the models uses a common structure : a decomposition tree of the set of data of normalization speakers. For the experiments, the Gaussian Mixture Model and t...

متن کامل

Generalized Modular Framework for Distillation Column Synthesis

In this work the distillation column sequencing problem is addressed through the Generalized Modular Framework, based on formal superstructure optimization techniques. The proposed method overcomes structural complexities through the use of systematically composed structural models incorporating all the feasible sequencing alternatives. The generated sequences are evaluated with respect to thei...

متن کامل

Experiments on speaker profile portability

This paper addresses the problem of speaker characterization in the speaker-dependent speech recognition problem. Speaker Adaptation and Normalization techniques are designed to reduce the mismatch introduced by inter-speaker variability. Yet there is another source of mismatch introduced by intra-speaker variability. Indeed, the speaking style of a speaker depends on the nature of the speech u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017